Previous Next






Welcome to Protein Solubility Predictor!

ProSol-Multi is an Artificial Intelligence-based tool that makes use of a novel protein sequence encoder to transform protein sequences into informative statistical vectors by capturing amino acids multi-level correlation and discriminative distribution within raw protein sequences. It passes informative statistical vectors to a random forest classifier to more precisely and accurately predict solubility of proteins. ProSol-Multi significantly outperforms state-of-the-art protein solubility predictors across 4 public benchmark datasets in k-fold cross-validation and independent test sets based evaluation. ProSol-Multi web interface enables the user to perform multi-dimensional analysis of protein sequences, training and optimizing the machine learning model from scratch, using pre-trained models of different species to make inferences on new sequences, and download interactive artifacts during the lifetime of the session.